Preliminary Chinese Term Classification for Ontology Construction
نویسندگان
چکیده
An ontology can be seen as a representation of concepts in a specific domain. Accordingly, ontology construction can be regarded as the process of organizing these concepts. If the terms which are used to label the concepts are classified before building an ontology, the work of ontology construction can proceed much more easily. Part-of-speech (PoS) tags usually carry some linguistic information of terms, so PoS tagging can be seen as a kind of preliminary classification to help constructing concept nodes in ontology because features or attributes related to concepts of different PoS types may be different. This paper presents a simple approach to tag domain terms for the convenience of ontology construction, referred to as Term PoS (TPoS) Tagging. The proposed approach makes use of segmentation and tagging results from a general PoS tagging software to predict tags for extracted domain specific terms. This approach needs no training and no context information. The experimental results show that the proposed approach achieves a precision of 95.41% for extracted terms and can be easily applied to different domains. Comparing with some existing approaches, our approach shows that for some specific tasks, simple method can obtain very good performance and is thus a better choice.
منابع مشابه
Chinese Core Ontology Construction from a Bilingual Term Bank
A core ontology is a mid-level ontology which bridges the gap between an upper ontology and a domain ontology. Automatic Chinese core ontology construction can help quickly model domain knowledge. A graph based core ontology construction algorithm (COCA) is proposed to automatically construct a core ontology from an English-Chinese bilingual term bank. This algorithm computes the mapping streng...
متن کاملThe Design and Implementation of Chinese Semantic Search Engine Based on FAQ Corpus and Ontology Construction from Information Extraction
Based on FAQ Corpus and Ontology Construction from Information Extraction Wen-Chih Chen, Lu-Ping Chang and Shi-Jim Yen Advanced e-Commerce Technology Lab., Institute for Information Industry, ROC National DongHwa University, Taiwan E-mail : {wjchen, clp}@iii.org.tw Abstract In the paper, we propose FAQ corpus and Ontology construction to implement Chinese semantic search engine. These frequentl...
متن کاملWhen Conset Meets Synset: A Preliminary Survey of an Ontological Lexical Resource Based on Chinese Characters
This paper describes an on-going project concerning with an ontological lexical resource based on the abundant conceptual information grounded on Chinese characters. The ultimate goal of this project is set to construct a cognitively sound and computationally effective character-grounded machine-understandable resource. Philosophically, Chinese ideogram has its ontological status, but its appli...
متن کاملAn Ontology-Based Method for Extracting and Classifying Domain-Specific Compositional Nominal Compounds
In this paper, we present our preliminary study on an ontology-based method to extract and classify compositional nominal compounds in specific domains of knowledge. This method is based on the assumption that, applying a conceptual model to represent knowledge domain, it is possible to improve the extraction and classification of lexicon occurrences for that domain in a semi-automatic way. We ...
متن کاملA Methodology for Domain Ontology Construction Based on Chinese Technology Documents
Ontology is considered as one of the most important roles in knowledge sharing and reusing. However, how to effectively construct the Chinese domain ontology is a difficult problem. This paper proposed a patternlearning Chinese domain ontology construction approach based on the fixed and simple characteristic of Chinese syntactic patterns in technology documents. The first step of this method i...
متن کامل